Implementing TreadMarks over GM on Myrinet: Challenges, Design Experience, and Performance Evaluation
نویسندگان
چکیده
Software based DSM systems like TreadMarks have traditionally not performed well compared to message passing applications because of the high overhead of communication associated with traditional stack based protocols like UDP. Modern interconnects like Myrinet offer reliable message delivery with very low communication overhead through user level protocols. This paper examines the viability of implementing a thin communication substrate between TreadMarks and Myrinet GM, the rationale being that a layer tuned to the needs of the application would offer better performance and scalability as opposed to a generic UDP layer. Trade-offs for various design alternatives for buffer management, connection setup, advance posting of descriptors and asynchronous messages are discussed. We have implemented the best of these strategies in a layer that is bound to TreadMarks at compile time. Results from micro-benchmarks and applications show that not only does the specialized implementation perform better, it also exhibits better parallel speedup and scalability. A reduction in total application execution time of up to a factor of 6.3 for a 16 node system is demonstrated in comparison with the original implementation. The implementation also exhibits superior scaling properties as the application size is increased.
منابع مشابه
High Performance and Reliable NIC-Based Multicast over Myrinet/GM-2
Multicast is an important collective operation for parallel programs. Some Network Interface Cards (NICs), such as Myrinet, have programmable processors that can be programmed to support multicast. This paper proposes a high performance and reliable NICbased multicast scheme, in which a NIC-based multisend mechanism is used to to send multiple replicas of a message to different destinations, an...
متن کاملPerformance Evaluation of Gigabit Ethernet and Myrinet for System-Area-Networks
Low latency and high bandwidth networking is essential for cluster computing and System-Area-Networks (SAN). The performance of a SAN optimized interconnect, Myrinet, is compared with gigabit Ethernet running TCP/IP. Though Myrinet has lower latencies and higher throughput than gigabit Ethernet, it is found that an efficient implementation of message passing interface library over TCP/IP achiev...
متن کاملA Quantitative Study on the Communication Performance of Myrinet Network Interfaces
This paper presents a series of experiments for evaluating the communication performance of Myrinet network adaptors. In particular, this paper studies the impact that arises from the interactions between the Myrinet network adaptors and host computer systems on communications performance. We report and analyze the communication performances of GM, the low-level messaging API for Myrinet, and o...
متن کاملFast NIC-Based Barrier over Myrinet/GM
An efficient barrier implementation is desirable on parallel systems to obtain good parallel speedup and to support finer-grained computation. Some modern Network Interface Cards (NICs) have programmable processors which can be used to provide support for collective communications such as barrier. In this paper, we utilize such a programmable NIC to provide an efficient barrier synchronization ...
متن کاملHigh-Performance Networking for Software DSMs
Several messaging software architectures (MSAs) have been proposed and implemented for high-performance local-area networks (LANs). Several of these MSAs have been successful at providing low latency and high bandwidth to user-level processes that communicate via explicit message passing. In this paper we claim that these MSAs are suboptimal for page-based software distributed shared-memory sys...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003